A Layered Grammar Model: Using Tree-Adjoining Grammars to Build a 
Common Syntactic Kernel for Related Dialects 
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Abstract 

This article describes the design of a com- 
mon syntactic description for the core 
grammar of a group of related dialects. 
The common description does not rely on 
an abstract sub-linguistic structure like a 
metagrammar: it consists in a single FS- 
LTAG where the actual specific language 
is included as one of the attributes in the 
set of attribute types defined for the fea- 
tures. When the Ian attribute is instanti- 
ated, the selected subset of the grammar is 
equivalent to the grammar of one dialect. 
When it is not, we have a model of a hy- 
brid multidialectal linguistic system. This 
principle is used for a group of Creole lan- 
guages of the West-Atlantic area, namely 
the French-based Creoles of Haiti, Guade- 
loupe, Martinique and French Guiana. 

1 Introduction 

Some of our present research aims at building for- 
mal linguistic descriptions for regional languages 
of the area of the Lesser Antilles and the Guianas, 
most of which are so-called "under-resourced lan- 
guages". We have concentrated our efforts on 
a specific group of languages, the French-based 
(or French-lexified) Creole languages of the West- 
Atlantic area. We are concerned with provid- 
ing users of those languages with electronic lan- 
guage resources, including formal grammars fit to 
be used for various Natural Language Processing 
(NLP) tasks, such as parsing or generation. 

We are developing formal grammars in the 
TAG (Tree- Adjoining Grammars) framework, the 



tree-centered unification-based syntactic formal- 
ism which has proven successful in modelling 
other languages of different types. TAG gram- 
mars may be lexicalized, so they provide a lexicon- 
centered description of phrase constructions 
(Schabes et al, 1988| ); and have been equipped 
with the formal tool of double-plane feature struc- 
tures, allowing the concept of feature structures 
unification to get adapted to the specific needs of 



adjunction ( Vijay-Shanker and Joshi, 1988 I 



In the context we are working in, two practi- 
cal reasons are leading to the search of solutions 
for factoring as much as possible of the gram- 
mars of those languages: first, the languages in 
this group are fairly close to one another, with re- 
spect to both lexicon and grammar; second, the re- 
sources dedicated to their description are scarce. 
The close relatedness makes it obvious for the lin- 
guist to try to leverage the efforts spent on describ- 
ing the grammar of one of the languages, by fac- 
toring out all the common parts of the grammatical 
systems. This principle has been used by other re- 
search work (see below. Section |4l). 



The originality of our approach is that we de- 
lay the point at which a single language is actu- 
ally chosen to the very last moment, namely at 
generation time (the same would apply to parsing 
time, but parsing has not been implemented yet). 
In the end, we propose a grammar which is not a 
grammar for one single dialect, but a grammar for 
a multidialectal complex, where language is one 
of the features selected in the grammar itself, like 
person, number, tense, or aspect. 



2 Coverage of the grammar 

The portion of the grammar described so far rep- 
resents only a small fragment of the grammar of 
the languages we are interested in. Until now, we 
have made attempts to describe: the determina- 
tion of noun phrases; the system of personal pro- 
nouns and determiners; the core system of expres- 
sion of tense, mood and aspect (TMA) of verbs — 
or, to put it more cautiously, of predicates — ; the 
main auxiliary verbs used to express other aspec- 
tual nuances; the expression of epistemic and de- 
ontic modality; the combination of the negation 
with the above mentioned subsystems (tense, as- 
pect, modality) in the predicative phrase. 

The grammar and lexicon files are built upon 
an ad-hoc implementation of FS-TAGs in Pro- 
log, which had originally been developed in 
another context and for another language, Ger- 



man ( [Vaillant, 1999[ ), and later adapted to Martini- 
can Creole d Vaillant, 20031 ) . 

The only function implemented at present is 
sentence generation; the starting point of the gen- 
eration is a conceptual graph, expressed by a min- 
imal set of spaning trees, which in turn select ele- 
mentary trees in the grammar (initial trees for the 
first pass, auxiliary trees for the remaining parts). 
We are testing our grammar on a small sample tests 
of such conceptual graphs. 

In the remainder of this article, we will focus 
the attention on two typical core subsystems of 
the grammar: determination in the noun phrase, 
and expression of tense and aspect in the predicate 
phrascl 

3 AppUcation to French-based Creoles 

The family of dialects to which we apply the ap- 
proach described is the family of French-based 
(sometimes called French-lexified) Creole lan- 
guages of the West- Atlantic area. Those languages 
emerged during the peak period of the slave trade 
epoch (1650-1800) when France, like some other 



'Precisely: SWI-Prolog, developed and main- 
tained by Jan Wielemaker, University of Amsterdam: 

http : / /www . swi -prolog .org. 

^It may be inadequate to speak of verb phrase in the case 
of the Creole languages mentioned here, since any lexical unit 
(including nouns, but also some closed-class units like loca- 
tive adverbs) may be inserted in the predicate slot of a sen- 
tence and bear tense or aspectual marks. So there probably are 
verbs, but possibly no "verb phrases" — see ( [Vaillant, 2003| l 
for a discussion. 



West-European nations, founded colonies in the 
New World and tried to develop intensive agri- 
cultural economic systems based on the exploita- 
tion of slave workforce massively imported from 
Africa. In the quickly developing new societies, 
at any given moment during that peak period, the 
number of people recently imported in any colony 
tended to be higher than the number of people ac- 
tually bom there — a typical situation for Unguis- 
tic instability. Moreover, the slaves were brought 
from different regions of Africa and had no com- 
mon language to communicate with, except the 
language of the European colons: so they were 
forced to use that target language, without having 
time to learn it fully before passing it on to the next 
generation of immigrants. This type of situation 
leads to a very specific drift of the language sys- 
tem, which begins to stabilize only when the soci- 
ety itself stabilizes. When observed in synchronic- 
ity at the present moment, those Creoles obviously 
appear as languages which share a very great por- 
tion of their vocabulary with French (more than 
90 %), but have a very specific grammatical sys- 
tem, quite different from the French one. 

The languages falling into the category com- 
prise French Creole dialects born and developed 
in former French colonies of the Caribbean Arc 
and its two continental "pillars": from the present 
US State of Louisiantdto French Guiana (formerly 
the Cayenne colony), on the northern coast of 
the South-American mainland. Caribbean islands 
where a French Creole has developed include His- 
paniola (in the western part of the island, the 
former French colony of Saint-Domingue, since 
1804 the independant republic of Haiti), Guade- 
loupe, the island of Dominica, Martinique, Saint- 
Lucia, and Trinidad (the latter also nearly ex- 
tinct). Among the languages listed, we leave apart, 
for lack of easily accessible sources and infor- 
mants, the case of Louisiana, Dominica, Saint- 
Lucia and Trinidad, and concentrate on the four 
Creoles of Haiti, Guadeloupe, Martinique and 
French Guiana. 

The question of how properly those languages 
qualify as a genetically related family has been dis- 
cussed in the literature. A starting point would be 



A nearly extinct French Creole dialect — not to be con- 
fused with Cajun French — is still understood by some peo- 
ple in the parishes of Saint-Martin, Iberville and Pointe- 
Coupee. 



the obvious statement that all of them have French 
as an ancestofl but this is not of much linguis- 
tic interest since, as we have seen, the related- 
ness with French lies principally in the vocabu- 
lary, whereas the Creole dialects have a great con- 
vergence in their grammatical systems, that they 
precisely do not owe to French. Some formerly 
proposed theories of monogenesis of all Creole 
languages are now largely out of fashion; how- 
ever, if the question is restricted to monogene- 
sis of a specific group of Creoles (e.g. French- 
based, or English-based) in a specific region of the 
world (e.g. the West-Atlantic area), monogenesis 
in this restricted acceptation remains a seriously 
discussed hypothesis. In any case, it has been es- 
tablished from historical sources that there was un- 
interrupted contact and interchange between the 
French colonies, from the first decades of colo- 
nization up to now, so that it is a safe bet to con- 
sider the different French Creole dialects as be- 
longing to a dialect continuum. Pfander (120001 
p. 192-209), notably, proposes an analysis of the 
family in terms of dialectal area, opposing center 
(Antilles) and periphery (Louisiana and Guiana), 
and gives comparison tables for the systems of ex- 
pression of tense and aspect. 

For a more detailed presentation of those 
languages, of their history, and of the dis- 
cussions they involve, the reader familiar 
with the French language may easily ac- 
cess ( [Hazael-Massieux, 2002| ). 

We will not enter into a detailed presentation 
of the grammatical systems of the Creoles. The 
most important thing to say here is that they are 
isolating languages, SVO ordered, with a strict po- 
sitional syntax, and that tense and aspect are ex- 
pressed by particles that are placed before the main 
predicate. As said above (Section |2ll, we will con- 
centrate on the noun phrase and on the TMA core 
system within the predicate phrase. Tables [T] and |2] 
give an overview of those two systems. They have 
been compiled from different sources (most partic- 
ularly dPfander, 2000] ) and dDamoiseau, 2007| ) for 
the comparative perspective, but also various other 
references for precise description points specific 
to some given language), and completed following 



our own observations on recent corpora. 

3.1 Determination in the noun phrase 

The four Creoles all possess four systematic de- 
grees of determination of nouns: a generic, an 
indefinite, a specific, and a demonstrative. The 
generic is used when the concept is taken for its 
general features as a category; in English, the same 
meaning could sometimes be expressed with a sin- 
gular, and sometimes with a plural (zwazo gen de 
zel (halt.): the bird has two wings / a bird has 
two wings / birds have two wings). For the sake 
of descriptive economy, in the formalization, we 
treat this generic degree as simply being one of 
the possible semantic values of the plural indef- 
inite (which is also expressed by the bare noun, 
with no articlejl. The indefinite degree, like in 
French or German, is expressed by a numeral (and 
its value is more specific, closer to the original 
semantics of the numeral, than it has become in 
French, for instance — where the indefinite arti- 
cle also is used to express the generic). The spe- 
cific degree (roughly equivalent to English "the") 
is expressed by a postposed article, historically de- 
riving from a French deictic adverb (/a). Lastly, 
the demonstrative degree derives from the com- 
bination of a former demonstrative pronoun, now 
sometimes preposed (guia.) and sometimes post- 
posed (other Creoles) to the noun, and to which the 
mark of the specific definite is added (with a case 
of fused form for Guadeloupean and Martinican). 

The plural is expressed either by a preposed 
marker derived from a former plural demonstra- 
tive (mart., guad.), or by a postposed third-person 
plural personal pronoun (halt., guia.), which in the 
case of guianese got fused with the definite mark 
(ye la [historical form, described in 1872] > ya 
[contemporary form]). 

In our formal model, we only keep three degrees 
of determination (indefinite, specific and demon- 
strative), which combine with two values for num- 
ber (singular and plural). Also, since the indefi- 
nite mark does not combine with the others (when 
in contrast, there is a combination between the 
marks of demonstrative and specific, with demon- 
strative =^ specific), we model the indefinite 



''The atypical mode of language transmission has led 
some historical linguists (Thomason and Kaufman, 1988 
p. 152) to refuse to apply the term of genetic transmission, 
but this point has been thoroughly criticized ( |DeGraff 72005 ( . 



^This interpretation agrees with a number of linguistic 
facts, like anaphora often involving a plural pronoun (zwazo 
gen de zel pou yo kapab vole: bird[s] have two wing[s] for 
them [to be] able [to] fly). 





hait. 


guad. 


mart. 


guia. 


english 


Generic 


moun 


moun 


moun 


moun 


person (human) 


Singular 


indefinite 




Lfll lILLfLllL 


nil IllLfLtIL 


IL/Llft linylrltl 




specific 


tflOUfl ficin 


moun la 


moun Ian 


moun an 


the person 


tab la 


tab la 


tab la 


tab a 


the table 


chyen an 


chyen la 


chyen an 


chyen an 


the dog 


zwazo a 


zozyo la 


zwezo a 


zozo a 


the bird 


demonstrative 


moun sa a 


moun lasa 


moun tala 


sa moun an 


that person 


tab sa a 


tab lasa 


tab tala 


sa tab a 


that table 


Plural 


indefinite 


III /~ttlVt 


III /~ttlVt 


117 

If IL/Llll 




npnnl p 


specific 


moun yo 


se moun la 


se moun Ian 


moun yan 


the persons 


tab yo 


se tab la 


se tab la 


tab ya 


the tables 


chyen yo 


se chyen la 


se chyen an 


chyen yan 


the dogs 


zwazo yo 


se zozyo la 


se zwezo a 


zozo ya 


the birds 


demonstrative 


moun sa yo 


se moun lasa 


se moun tala 


sa moun yan 


those people 


tab sa yo 


se tab lasa 


se tab tala 


sa tab ya 


those tables 



Table 1 : Determination in the noun phrase 



by an absence of determination feature; the spe- 
cific is modeled by the feature (spe = +); and 
the demonstrative by the combination of features 
(spe = +), (dem = +) . 

In some dialects, a phenomenon of nasal pro- 
gressive assimilation changes the surface form of 
the postposed specific article (hait., mart., guia.); 
in others, in addition, the surface form of the ar- 
ticle differs depending on whether the preceding 
word ends with a vowel or a consonant (hait., 
mart.). The four possible combinations aie shown 
in tabled! 

3.2 Tense and aspect in the predicative 
phrase 

In Creole linguistics, a classical description given 
of the TMA (Tense-Mood-Aspect) system of the 
"Atlantic" Creole language^ mentions three op- 
tional components appearing in a very strict or- 
der: past tense mark; "mood" mark (able to take 
future or irrealis values, depending on contexts); 
imperfective aspect mark. A canonical version of 
this system has been given for French-based Cre- 
oles by Valdman (1978i ). who actually describes 
those three categories as one category of tense 
(past) and two categories of aspect (prospective 
and continuative). The "middle" mark (Valdman's 
"prospective") takes on an irrealis meaning when 
it is combined with the past tense. 

So, there is a combinatory system: (te/0) x 
(ke/ 0) X (ka 1 0) (if we call the three marks by the 
form they have in the three Creoles of Guadeloupe, 

*The schema also holds for English-based Cre- 
oles ( [Bickerton, 198 It - 



Martinique and Guyane), which in theory gener- 
ates eight possible combinations: 0, ka, ke, ke ka, 
te, te ka, te ke, te ke ka. The eight combinations 
are attested to different degrees, with the seman- 
tic values given in table |2l In Haitian Creole, the 
corresponding forms are te, va and ap, and some 
combinations yield fused forms (va ap > vap; te 
ap > tap; te va > ta; te va ap > ta vap). 

In fact, there are variations in this basic schema. 
For instance, the term "imperfective" covers a 
complex of diverse meanings (progressive, fre- 
quentative, or simply unaccomplished) which do 
not strictly overlap in the different dialects. For 
instance, if the mark ka may bear all the above- 
mentioned meanings in the Creoles of Guade- 
loupe or Martinique (up to some general temporal 
value roughly corresponding to the English simple 
present), it is not necessarily so in the Creole of 
Guiana, and it is quite false for the Creole of Haiti 
(where the unaccomplished is unmarked, and the 
only aspectual value of particle ap is the progres- 
sive, corresponding not to English simple present, 
but to English BE -i- -ing — and even able to take 
over the temporal value of a future). Table|2]shows 
these differences. 

Lastly, it is important to notice that the combi- 
nations of the TMA marks are constrained by the 
semantics of the unit placed in the predicate posi- 
tion. For instance, a verb with a "non-processual" 
meaning (like konet, to know), or an adjective re- 
ferring to a state (like malad, ill), will hardly com- 
bine with an imperfective aspect marker like ka; 
if they do, however, it will necessarily produce a 
meaning effect that will shift the contextual mean- 





hait. 


guad. 


mart. 


guia. 


A .. , — : : 

Accomplished / Aoristic 


danse 


danse 


danse 


danse 


Unaccomplished / Present 


dcinse 


ka danse 


ha danse 


(ha) danse 


Frecjuentative 


dciHse 


ha danse 


ha danse 


ka danse 


Progres sive 


cip dcinse 


ha danse 


ha danse 


ka danse 


Near Future 


pral danse 


hay danse 


hay danse 


k' ale / hay danse 


Future 


vd dcinse 


he danse 


he danse 


he danse 


Unaccomplished Future (seldom) 


vap danse 


he ha danse 


he ha danse 


he ha danse 


Accomplished past (pluperfect) 


te danse 


te danse 


te danse 


te danse 


Unaccomplished past 


tap danse 


te ha danse 


te ha danse 


te ha danse 


Irrealis 


ta danse 


te he danse 


te he danse 


te he danse 


Irrealis unaccomplished 


ta vap danse 


te he ha danse 


te he ha danse 


te he ha danse 


Conditional / Optative 


ta danse 


te he danse 


se danse 


te he danse 



Table 2: Core tense and aspect marking in the predicative phrase 



ing towards a less "stative" value. For example, an 
utterance like mo ka malad (I-lMP-ill) might be at- 
tested; and it is to be interpreted, depending on the 
context, either as a frequentative (at every back to 
school time, I get flu), or as a progressive (I feel I 
am coming down to flu). 

3.3 Some TAG model elements 

In figures [T] and [2j we show the main components 
of the model for the noun phrase system presented 
in table [T] represented as elementary trees with a 
language parameter /0. 

It should be noted that the trees Dem Det 
(gp,mq) and Plur (gp,mq), which concern only two 
dialects among the four (Guadeloupean and Mar- 
tinican), are included in the common layer with- 
out risking to interfere with the construction of the 
demonstrative or plural in Haitian or Guianese (in 
fact, unification constraints forbid the adjunction 
of a GP/MQ demonstrative on a HT/GF demon- 
strative; likewise, they forbid the adjunction of a 
GP/MQ plural on a HT/GF plural). 

The adjunction of the demonstrative in Haitian 
or Guianese is done above the level of the noun 
complements (attention to parameter bar in the 
trees Dem (gf) et Dem (ht)), but below the spe- 
cific article; e.g. moun Sentoma sa yo (hait.): those 
people from Saint-Thomas; sa moun Senloran an 



^The following abbreviations are used for the attributes: 
bar = bar level (1 = noun with complements, but no determi- 
nation; 2 = noun phrase); nbr = number; spe = specific deter- 
miner; dem = demonstrative determiner; ens = the constituent 
ends with a consonant; nas = the constituent ends with a nasal 
syllable; Ian = language. The values used to identify the four 
Creoles are based on the two-letter country codes defined in 
standard ISO-3166 for country names: HT for Haiti, GP for 
Guadeloupe, MQ for Martinique, and GF for French Guiana 
(going from North to South... and by decreasing population 
count.) Non-instantiated variables are in italics. 



<spe =-> 
N <dem = -> 
<cns =c> 
<nas =«> 
<lan = / > 



(c.n.l) 



N with no compi. 



Det 



Ind ^ 0) 



<bar = 2> 
<spe =-> 
<dem =-> 
<nbr = sing> 
. <lan = / > 

<bar = 1 > 
<spe =-> 
<dem =-> 
<lan = / > 



Ind Det sg (*} 



<bar =2> 
<spe =+> 
<dem =d> 
<nbr=.Y> 
v.^<lan = / > 



Det 



<bar = 1 > 
<dem =d> 
<cns =c> 
<nas =n> 
<ian = / > 



Spe ^ (x.cnj) 




Spe Det(*) 



Det 



Plur „ (I) 



<bar =2> 
]Sj <spe=+> 



<bar =2> 
<spe =+> 
<nbr = sing> 
<lan = / > 



Dem Det (gp.mq) 



Plur (gp.mq) 



Figure 1: Common elements in the NP model. 



Det 



Dem „ (gf) 



<bar = 1 > 
<dem =+> 
<nbr = sing > 
<cns=c> 
<nas =/f> 
.^laii =gf> 



<bar = 1 > 
<dem = - > 
<cns = c> 
<nas = n> 
<lan =gf> 



<bar = 1> 
<dem=+> 
<nbr = sing > 
,<;lan = ht > 

Det 



<bar = 1 > 
<dem = - > 
<lan =ht > 



Dem (ht) 



Dem (gf) 



Dem (ht) 



Figure 2: NP modelling elements specific to 
Haitian and guianese 



(guia.): those people from Saint-Laurent. 

The TMA system, on its side, is in a great part 
common to the four languages. Auxiliary trees 
modelling the adjunction of aspectual or tempo- 
ral values hence are all common (fig.©. The only 
nuance resides in the fact that the tree for adjoin- 
ing an aspect particle to convey general values of 
imperfective (durative, frequentative) cannot unify 
when the Ian parameter is set to Haitian. In the 
end, only the lexical (surfaceWalues make the dif- 
ferences between the dialect^- 



Pred 



Pred 



<bar=2> 
<lan = / > 



<pi-g=-> 
<imp =-> 
<psp=-> 
<pas = -> 
<lan = !> 



Asp 



Imp 



Pred * 
<prg=-> 
<imp = -> 
<psp = -> 
<pas = -> 
(I) <lan=/> 



Inlr. V(*) 



Pred 



Progressive ( 



Pred 



<psp = 



Asp 



Pred* 
<pig = -> 
<imp =-> 
<psp = -> 
<pas = -> 



Asp 



Pred* 

<psp=-> 
<pas = -> 
<lan = /> 



Imp ^ CO <lan=( = gplmqlgf > Prosp^f/J 
Imperfective (general) (gp,mq,gf) Prospective (*) 



Pred 



Pred 



<pas = 



Asp 



Pred* 

<psp =-> 
<pas = -> 
<lan = / > 



Tern 



Pred* 



<pas = -> 
<lan = / > 



Prox^fZ) 

Near Fixture (*) 



Pas ^ (I) 
Fasti*} 



Figure 3: Common elements in the predicative 
phrase model 



4 Related work 

The idea of factoring some of the efforts of 
grammar modelling to exploit similar structures 
among different languages has already been 
tackled by some research works, among which 
we are particularly aware of those led at Jussieu 
within 
the 



the FTAG project 
Lexorg project 



(Candito, 1998), 



(XiaetaL, 19981, the 



The following abbreviations are used for the attributes in 
fig. [3] Tense: pas = past; Aspects: psp = prospective; prx = 
proximal prospective ("imminent" aspect ~ temporal value of 
a near future) ; imp = imperfective (general) ; prg = progres- 
sive (like in English "I am doing..."). 



LinGO grammar Matrix 


(Bender etal, 2002 


Bender and Flickinger, 2005 [ 


1 the LORIA XMG 


project (Parmentier et al., 2006"), and Bouillon et 



al.'s work (I2006II on multilingual multipurpose 
grammar 

Works like Candito's ( fT998l) (for French and 
Italian) or Xia and Palmer's (1998) (for English 
and Chinese), are based on the idea of using meta- 
grammars, that is higher-level descriptions of gen- 
eral properties of the language(s) described. The 
higher-level descriptions for different languages 
may be factored as long as the languages share 
typological features. In the end, an actual LTAG 
grammar is generated from the meta-grammar, tai- 
lored for one specific language. In this type of ap- 
proaches, what is actually shared between the lan- 
guages is a higher-level structure, not actual gram- 
matical structures belonging to the LTAG descrip- 
tion of the languages. 

In the LinGO grammar matrix ap- 
proach ( [Bender et al., 2002] ), underspecified 
HPSG structures (with a minimal recursion 
semantics) are used to share information between 
different languages. A system based on shell 
scripts is used to automatically generate grammar 
files for a specific language, when given a couple 
of general typological specific information (word 
order pattern, case marking strategy, etc.). 

The approach which most resembles the one 
advocated in the present paper is Bouillon et 
al.'s (12006b way of devising quickly re-usable 
grammars for speech recognition programs, based 
on shared grammatical descriptions for related ro- 
mance languages (French, Castilian Spanish, and 
Catalan). The authors include "macros" in their 
DCG-style upper-level description, and the macros 
allow to specify alternative points where the lan- 
guages differ (like the position of cUtics in specific 
verb forms, the optionality of determiners, the op- 
tional presence of prepositions for object comple- 
ments, etc.). In a last stage, the DCG-style spec- 
ification is compiled to ad hoc CFGs tailored for 
speech recognition engines, each for a specific lan- 
guage and task. 

Our approach, in contrast, is not a meta- 
grammar approach; what is shared between the 



See LinGO grammar matrix' 



web 



site: 



http : / / www . delph-in . net / matrix/. 

'"Thanks to the reviewers of the preliminary version of this 
article for pointing to some useful references. 



different languages are actual LTAG trees. The 
"language" parameter is embedded in the very fea- 
ture structures of tree nodes. So, our lexical- 
grammatical descriptions reside in one single level 
of description, but that level is "modularized": 
some descriptions are common to all the dialects 
described, some are shared by only part of them, 
and some are specific. In other works, even those 
which are not based on meta-grammars (like Ben- 
der's or Bouillon's), the goal is to generate a gram- 
mar for a single language in the end. In the present 
work we are aiming at giving a description of a 
multidialectal linguistic system. 

5 Discussion 

The above-mentioned modelling choice may seem 
counter-intuitive in the theoretical frame of struc- 
tural linguistics. One might object that if the lan- 
guage itself is the whole object of description, then 
it is absurd to include it as a category in the de- 
scription. This view is justified as long as one does 
not wish to take into account dialectal variation as 
an internal system variable. If this is the case, then 
every single dialect must be considered an isolate 
and be given a holistic, unitary description. 

But in the context we are working in, several ra- 
tionales lead us to think that it might be a good idea 
to include dialectal variation in the description. 

We already have mentioned practical reasons 
(see above, in Introduction). The "time saving" 
and "resource sharing" rationales applies to our 
method as well as to others (like meta-grammars). 
A supplementary argument, which applies more 
specifically to our method, is the fact that in the 
cases we are studying, not only some syntactical 
properties of the languages are common, but also 
an important part of the vocabulary, until at the 
very surface level. This speaks for sharing bottom- 
level structures. 

But there is another, less practical, type of argu- 
ment: if we have a modular grammatical system 
model which "contains" more than one language 
in itself, we are able to model the linguistic com- 
petence in one of the languages, but also to model 
multilingual (in the present case, multidialectal) 
linguistic competence. 

If our goal is to model monolingual compe- 
tence, this is easily done by unifying the Ian pa- 
rameter with one of its possible values, and then 
erasing the (now redundant) parameter from the 



description. 

However, in some cases, we might want to have 
a model of multidialectal variation. Considered 
from the E-language side, we then have a model 
of a dialectal continuum. Considered from the I- 
language side, we have a model of the linguistic 
competence of a multilingual speaker of related di- 
alects. The interplay of grammatical structures of 
a multidialectal system, the possibilities of com- 
bination and unification given different levels of 
instantiation of the Ian parameter, might provide 
us with a model for such linguistic phenomena as: 
specialized repertoires, code switching, code mix- 
ing, or koine emergence. That work, at the present 
stage, is still to be done: it is a mere idea of fu- 
ture research directions to evaluate the potential 
of our modelling method. Yet it is an appealing 
idea, given that in some types of contexts, multi- 
linguality among related dialects is a common sit- 
uatioio, and that phenomena such as code switch- 
ing or code mixing are more frequent than the op- 
posite — the use of a single unitary language with 
a single nor nS It is also a matter of future re- 
search to evaluate the degree of pai^sing feasibility 
for mixed linguistic input. 
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